Search | WHO COVID-19 Research Database

1.

Read2Tree: scalable and accurate phylogenetic trees from raw reads (preprint)

David Dylus; Adrian M Altenhoff; Sina Majidian; Fritz J Sedlazeck; Christophe Dessimoz.

biorxiv; 2022.

Preprint in English | bioRxiv | ID: ppzbmed-10.1101.2022.04.18.488678

ABSTRACT

The inference of phylogenetic trees from raw sequencing reads is foundational to biology. However, state of the art phylogenomics requires running complex pipelines, at significant computational and labour costs, with additional constraints in sequencing coverage, assembly and annotation quality. To overcome these challenges, we present Read2tree, which directly processes raw sequencing reads into groups of corresponding genes. In a benchmark encompassing a broad variety of datasets, our assembly free approach was 10 to 100x faster than conventional approaches, and in most cases more accurate the exception being when sequencing coverage was high and reference species very distant. To illustrate the broad applicability of the tool, we reconstructed a yeast tree of life of 435 species spanning 590 million years of evolution. Applied to Coronaviridae samples, Read2Tree accurately classified highly diverse animal samples and near-identical SARSCoV2 sequences on a single tree - thereby exhibiting remarkable breadth and depth. The speed, accuracy, and versatility of Read2Tree enables comparative genomics at scale.

2.

Construction of a new chromosome-scale, long-read reference genome assembly of the Syrian hamster, Mesocricetus auratus (preprint)

R. Alan Harris; Muthuswamy Raveendran; Dustin T Lyfoung; Fritz J Sedlazeck; Medhat Mahmoud; Trent Prall; Julie Karl; Harshavardhan Doddapaneni; Qingchang Meng; Yi Han; Donna Muzny; Roger W. Wiseman; Jeffrey Rogers.

biorxiv; 2021.

Preprint in English | bioRxiv | ID: ppzbmed-10.1101.2021.07.05.451071

ABSTRACT

Background The Syrian hamster (Mesocricetus auratus) has been suggested as a useful mammalian model for a variety of diseases and infections, including infection with respiratory viruses such as SARS-CoV-2. The MesAur1.0 genome assembly was published in 2013 using whole-genome shotgun sequencing with short-read sequence data. Current more advanced sequencing technologies and assembly methods now permit the generation of near-complete genome assemblies with higher quality and higher continuity. Findings Here, we report an improved assembly of the M. auratus genome (BCM_Maur_2.0) using Oxford Nanopore Technologies long-read sequencing to produce a chromosome-scale assembly. The total length of the new assembly is 2.46 Gbp, similar to the 2.50 Gbp length of a previous assembly of this genome, MesAur1.0. BCM_Maur_2.0 exhibits significantly improved continuity with a scaffold N50 that is 6.7 times greater than MesAur1.0. Furthermore, 21,616 protein coding genes and 10,459 noncoding genes were annotated in BCM_Maur_2.0 compared to 20,495 protein coding genes and 4,168 noncoding genes in MesAur1.0. This new assembly also improves the unresolved regions as measured by nucleotide ambiguities, where approximately 17.11% of bases in MesAur1.0 were unresolved compared to BCM_Maur_2.0 in which the number of unresolved bases is reduced to 3.00%. Conclusions Access to a more complete reference genome with improved accuracy and continuity will facilitate more detailed, comprehensive, and meaningful research results for a wide variety of future studies using Syrian hamsters as models.

3.

Oligonucleotide Capture Sequencing of the SARS-CoV-2 Genome and Subgenomic Fragments from COVID-19 Individuals (preprint)

harshavardhan doddapaneni; Sara Javornik Cregeen; Richard Sucgang; Qingchang Meng; Xiang Qin; Vasanthi Avadhanula; Hsu Chao; Vipin Menon; Erin Nicholson; David Henke; Felipe-Andres Piedra; Anubama Rajan; Zeineen Momin; Kavya Kottapalli; Kristi L. Hoffman; Fritz J. Sedlazeck; Ginger Metcalf; Pedro A. Piedra; Donna M. Muzny; Joseph F. Petrosino; Richard A. Gibbs; Stephanie Erbar; Ferdia Bates; Diana Schneider; Bernadette Jesionek; Bianca Saenger; Ann-Kathrin Wallisch; Yvonne Feuchter; Hanna Junginger; Stefanie A. Krumm; Andre P. Heinen; Petra Adams-Quack; Julia Schlereth; Stefan Schille; Christoph Kroener; Ramon de la Caridad Gueimil Garcia; Thomas Hiller; Leyla Fischer; Rani S. Sellers; Shambhunath Choudhary; Olga Gonzalez; Fulvia Vascotto; Matthew R. Gutman; Jane Fontenot; Shannan Hall-Ursone; Kathleen Brasky; Matthew C Griffor; Seungil Han; Andreas A.H. Su; Joshua Lees; Nicole L. Nedoma; Ellene H. Mashalidis; Parag V. Sahasrabudhe; Charles Y. Tan; Danka Pavliakova; Guy Singh; Camila Fontes-Garfias; Michael Pride; Ingrid L. Scully; Tara Ciolino; Jennifer Obregon; Michal Gazi; Ricardo Carrion Jr.; Kendra J. Alfson; Warren V. Kalina; Deepak Kaushal; Pei-Yong Shi; Thorsten Klamp; Corinna Rosenbaum; Andreas N. Kuhn; Oezlem Tuereci; Philip R. Dormitzer; Kathrin U. Jansen; Ugur Sahin.

biorxiv; 2020.

Preprint in English | bioRxiv | ID: ppzbmed-10.1101.2020.12.11.421057

ABSTRACT

The newly emerged and rapidly spreading SARS-CoV-2 causes coronavirus disease 2019 (COVID-19). To facilitate a deeper understanding of the viral biology we developed a capture sequencing methodology to generate SARS-CoV-2 genomic and transcriptome sequences from infected patients. We utilized an oligonucleotide probe-set representing the full-length genome to obtain both genomic and transcriptome (subgenomic open reading frames [ORFs]) sequences from 45 SARS-CoV-2 clinical samples with varying viral titers. For samples with higher viral loads (cycle threshold value under 33, based on the CDC qPCR assay) complete genomes were generated. Analysis of junction reads revealed regions of differential transcriptional activity and provided evidence of expression of ORF10. Heterogeneous allelic frequencies along the 20kb ORF1ab gene suggested the presence of a defective interfering viral RNA species subpopulation in one sample. The associated workflow is straightforward, and hybridization-based capture offers an effective and scalable approach for sequencing SARS-CoV-2 from patient samples.

Subject(s)

COVID-19 , Infections

4.

Oligonucleotide capture sequencing of the SARS-CoV-2 genome and subgenomic fragments from COVID-19 individuals (preprint)

Harsha Vardhan Doddapaneni; Sara Javornik Cregeen; Richard Sucgang; Qingchang Meng; Xiang Qing; Vasanthi Avadhanula; Hsu Chao; Vipin Menon; Erin Nicholson; David Henke; Felipe-Andres Piedra; Anubama Rajan; Zeineen Momin; Kavya Kottapalli; Kristi L. Hoffman; Fritz J Sedlazeck; Ginger Metcalf; Pedro A Piedra; Donna M Muzny; Joseph F Petrosino; Richard A Gibbs.

biorxiv; 2020.

Preprint in English | bioRxiv | ID: ppzbmed-10.1101.2020.07.27.223495

ABSTRACT

The newly emerged and rapidly spreading SARS-CoV-2 causes coronavirus disease 2019 (COVID-19). To facilitate a deeper understanding of the viral biology we developed a capture sequencing methodology to generate SARS-CoV-2 genomic and transcriptome sequences from infected patients. We utilized an oligonucleotide probe-set representing the full-length genome to obtain both genomic and transcriptome (subgenomic open reading frames [ORFs]) sequences from 45 SARS-CoV-2 clinical samples with varying viral titers. For samples with higher viral loads (cycle threshold value under 33, based on the CDC qPCR assay) complete genomes were generated. Analysis of junction reads revealed regions of differential transcriptional activity and provided evidence of expression of ORF10. Heterogeneous allelic frequencies along the 20kb ORF1ab gene suggested the presence of a defective interfering viral RNA species subpopulation in one sample. The associated workflow is straightforward, and hybridization-based capture offers an effective and scalable approach for sequencing SARS-CoV-2 from patient samples.

Subject(s)

COVID-19 , Infections

5.

Hidden genomic diversity of SARS-CoV-2: implications for qRT-PCR diagnostics and transmission (preprint)

Nicolae Sapoval; Medhat Mahmoud; Michael D. Jochum; Yunxi Liu; R.A. Leo Elworth; Qi Wang; Dreycey Albin; Huw Ogilvie; Michael D. Lee; Sonia Villapol; Kyle Hernandez; Irina Maljkovic Berry; Jon Foox; Afshin Beheshti; Krista Ternus; Kjersti M. Aagaard; David Posada; Christopher Mason; Fritz J Sedlazeck; Todd J Treangen.

biorxiv; 2020.

Preprint in English | bioRxiv | ID: ppzbmed-10.1101.2020.07.02.184481

ABSTRACT

The COVID-19 pandemic has sparked an urgent need to uncover the underlying biology of this devastating disease. Though RNA viruses mutate more rapidly than DNA viruses, there are a relatively small number of single nucleotide polymorphisms (SNPs) that differentiate the main SARS-CoV-2 clades that have spread throughout the world. In this study, we investigated over 7,000 SARS-CoV-2 datasets to unveil both intrahost and interhost diversity. Our intrahost and interhost diversity analyses yielded three major observations. First, the mutational profile of SARS-CoV-2 highlights iSNV and SNP similarity, albeit with high variability in C>T changes. Second, iSNV and SNP patterns in SARS-CoV-2 are more similar to MERS-CoV than SARS-CoV-1. Third, a significant fraction of small indels fuel the genetic diversity of SARS-CoV-2. Altogether, our findings provide insight into SARS-CoV-2 genomic diversity, inform the design of detection tests, and highlight the potential of iSNVs for tracking the transmission of SARS-CoV-2.

Subject(s)

COVID-19

6.

Host, Viral, and Environmental Transcriptome Profiles of the Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) (preprint)

Daniel J Butler; Christopher Mozsary; Cem Meydan; David C Danko; Jonathan Foox; Joel Rosiene; Alon Shaiber; Ebrahim Afshinnekoo; Matthew MacKay; Fritz J Sedlazeck; Nikolay A Ivanov; Maria A Sierra; Diana Pohle; Michael Zietz; Undina Gisladottir; Vijendra Ramlall; Craig D Westover; Krista Ryon; Benjamin Young; Chandrima Bhattacharya; Phyllis Ruggiero; Bradley W Langhorst; Nathan A Tanner; Justyna Gawrys; Dmitry Meleshko; Dong Xu; Peter A D Steel; Amos J Shemesh; Jenny Xiang; Jean Thierry-Mieg; Danielle Thierry-Mieg; Robert E Schwartz; Angelika Iftner; Daniela Bezdan; John Sipley; Lin Cong; Arryn Craney; Priya Velu; Ari Melnick; Iman Hajirasouliha; Stacy M Horner; Thomas Iftner; Mirella Salvatore; Massimo Loda; Lars F Westblade; Shawn Levy; Melissa Cushing; Shixiu Wu; Nicholas P Tatonetti; Marcin Imielinski; Hanna Rennert; Christopher Mason.

biorxiv; 2020.

Preprint in English | bioRxiv | ID: ppzbmed-10.1101.2020.04.20.048066

ABSTRACT

The Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) has caused thousands of deaths worldwide, including >18,000 in New York City (NYC) alone. The sudden emergence of this pandemic has highlighted a pressing clinical need for rapid, scalable diagnostics that can detect infection, interrogate strain evolution, and identify novel patient biomarkers. To address these challenges, we designed a fast (30-minute) colorimetric test (LAMP) for SARS-CoV-2 infection from naso/oropharyngeal swabs, plus a large-scale shotgun metatranscriptomics platform (total-RNA-seq) for host, bacterial, and viral profiling. We applied both technologies across 857 SARS-CoV-2 clinical specimens and 86 NYC subway samples, providing a broad molecular portrait of the COVID-19 NYC outbreak. Our results define new features of SARS-CoV-2 evolution, nominate a novel, NYC-enriched viral subclade, reveal specific host responses in interferon, ACE, hematological, and olfaction pathways, and examine risks associated with use of ACE inhibitors and angiotensin receptor blockers. Together, these findings have immediate applications to SARS-CoV-2 diagnostics, public health, and new therapeutic targets.

Subject(s)

COVID-19

ABSTRACT

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL